Automated Template Discovery for Information Extraction from Biomedical Literature
نویسندگان
چکیده
We propose a method to automatically extract templates from biomedical literature without background knowledge. The proposed method automatically extracts verbs and templates indicating interactions between biomolecules with a large dictionary called an extensional ontology. We applied our method to two datasets: one comprised 299 full texts from Cell (1998– 2002) and 13,818 entries from OMIM (Online Mendelian Inheritance in Man); the other included 33,622 abstracts from Medline (2002). Experimental results showed that our method could extract verbs and templates that had been manually collected in related works. For extracting templates, our method only needs to prepare ontology (or dictionary) and a large body of texts. Consequently, it can be applied to those of other fields as well as the biomedical literature.
منابع مشابه
Survey on Perception of People Regarding Utilization of Computer Science & Information Technology in Manipulation of Big Data, Disease Detection & Drug Discovery
this research explores the manipulation of biomedical big data and diseases detection using automated computing mechanisms. As efficient and cost effective way to discover disease and drug is important for a society so computer aided automated system is a must. This paper aims to understand the importance of computer aided automated system among the people. The analysis result from collected da...
متن کاملLitLinker: A System for Searching Potential Discoveries in Biomedical Literature
The explosive growth in biomedical literature has made it difficult for researchers to keep up with advancements, even in their own narrow specializations. While researchers formulate new hypotheses to test, it is very important for them to identify connections to their work from other parts of the literature. However, the current volume of information has become a great barrier for this task, ...
متن کاملSemi-Automated Semantic Annotation of the Biomedical Literature
Semantic annotations are a core enabler for efficient retrieval of relevant information in the life sciences as well in other disciplines. The biomedical literature is a major source of knowledge, which however is underutilized due to the lack of rich annotations that would allow automated knowledge discovery. We briefly describe the results of the SASEBio project (Semi Automated Semantic Enric...
متن کاملUsing statistical and knowledge-based approaches for literature-based discovery
The explosive growth in biomedical literature has made it difficult for researchers to keep up with advancements, even in their own narrow specializations. While researchers formulate new hypotheses to test, it is very important for them to identify connections to their work from other parts of the literature. However, the current volume of information has become a great barrier for this task a...
متن کاملEnhancing a biomedical information extraction system with dictionary mining and context disambiguation
Journals and conference proceedings represent the dominant mechanisms for reporting new biomedical results. The unstructured nature of such publications makes it difficult to utilize data mining or automated knowledge discovery techniques. Annotation (or markup) of these unstructured documents represents the first step in making these documents machine-analyzable. Often, however, the use of sim...
متن کامل